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^ ■ Likelihood ratio tests are performed for the hypothesis that charged-particle multiplicities 

' measured in proton-(anti)proton collisions at = 0.9 and 2.36 TeV are distributed 

QQ ' according to the negative binomial form. Results indicate that the hypothesis should be 

, rejected in the all cases of ALICE-LHC measurements in the limited pseudo-rapidity 

1 windows, whereas should be accepted in the corresponding cases of UA5 data. Possible 

explanations of that and of the disagreement with the least-squares fitting method are 
given. 
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' 1. Introduction 



The UA5 Collaboration noticed for the first time that charged-particle multiplicity 
distributions measured in high energy proton- (anti)proton collisions in limited in- 
tervals of pseudo-rapidity have the negative binomial form ^ . In the present paper 
this observation will be verified for the collisions at y/s ~ 0.9 and 2.36 TeV per- 
formed by UAsl^land ALICE Collaborations iSl. Only non-single diffractive (NSD) 
events will be considered because such a case was analyzed with this respect by 
both Collaborations. In fact, the author investigated ALICE inelastic events also 
(including the case of \/s = 7 TeV^^, but all fits were entirely unacceptable. 
The Negative Binomial Distribution (NED) is defined as 

Pin;p,k) = Hk + l)ik + 2)...ik + n-l)^^^^^^^, 

where n = 0, 1, 2, < p < 1 and fc is a positive real number. In the application to 
high energy physics n has the meaning of the number of charged particles detected 
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in an event. The expected value n and variance 



V{n) are expressed 



as: 



k{l-p) 



V{n) 



fc(l-p) 

p2 



(2) 



n — 



P 



In this analysis the hypothesis that the cliarged-particle multiplicities measured 
in high energy p — p{p) collisions are distributed according to the NBD is verified 
with the use of the maximum likelihood method (ML) and the likelihood ratio test. 
More details of this approach can be found in Refs. [IHTl 

There are two crucial reasons for this approach: 

(i) The fitted quantity is a probability distribution function (p.d.f.), so the most 
natural way is to use the ML method, where the likelihood function is con- 
structed directly from the tested p.d.f.. But more important is that because of 
Wilks's theorem (see Appendix B) one can easily define a statistic, the distri- 
bution of which converges to a distribution as the number of measurements 
goes to infinity. Thus for the large sample the goodness-of-fit can be expressed 
as a p-value computed with the corresponding distribution. 

(ii) The most commonly used method, the least-squares method (LS) (called also 
minimization), has the disadvantage of providing only the qualitative measure 
of the significance of the fit, in general. Only if observables are represented 
by Gaussian random variables with known variances, the conclusion about the 
goodness-of-fit equivalent to that mentioned in the point (i) can be derived 

It is worth noting that the ML method with binned data and Poisson fluctuations 
within a bin was already applied to fitting multiplicity distributions to the NBD 
but at much lower energies (E-802 Collaboration'^). 

2. The maLximum likelihood method 

The number of charged particles Nch is assumed to be a random variable with the 
p.d.f. given by Eq. ([T]). Each event is treated as an independent observation of Nch 
and a set of a given class of events is a sample. For N events in the class there are 
N measurements of Net, say X = {Xi, X2, Xjv}. Some of these measurements 
can be equal, i.e. Xi = Xj for i ^ j can happen. The whole population consists 
of all possible events with the measurements of 0, 1, 2,... charged particles and by 
definition is infinite 0. 

For the class of events one can defined the likelihood function 

^Here, these quantities are distinguished from the experimentally measured the average charged 
particle multiplicity {N^^) and the variance . 

^ Precisely, because of the energy conservation the number of produced charged particles is limited 
but the number of collisions is not. 
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N 

L(X|p,fc) = [] P(X,;p,fc) , (3) 

where P{Xf,p,k) is the NBD, Eq. ([J). 

The values p and k for which L(X | p, k) has its maximum are the maximum 
likelihood estimates of parameters p and k. This is equivalent to the maximization 
of the log-likelihood function 

N 

\nL{y.\p,k) = Y,\nP{X,-p,k) . (4) 
Thus the values p and k are the solutions of the equations: 
d ^8 

-lnL(X|p,fc) = ^ -lnP(X,;p,fc)=0, 
3 ^ 1 

— \nL{X\p,k)^J2 ^lnP(^,;P,fc)=0. (5) 

It can be proven that one of the necessary conditions for the existence of the 
maximum is (see Appendix A for details): 



n = , (6) 

i.e. the distribution average has to be equal to the experimental average. 



3. Likelihood ratio test 

Let divide the sample defined in Sect. [2] into m bins characterized by Yi - the 
number of measured charged particles El and Ui - the number of entries in the zth 
bin, N = jy^i "-2 (details of the theoretical framework of this Section can be found 
in Refs.[5]-[7]). Then the expectation value of the number of events in the ith bin can 
be written as 



Vi{vtot,P,k) = vtot ■ P{yi:P,k) , (7) 

where vtot is the expected number of all events in the sample, vtot — X^I^i ^i- This 
is because one can treat the number of events in the sample N also as a random 
variable with its own distribution - Poisson one. Generally, the whole histogram can 

'^Now Yi ^ Yj for i j and i,j = 1, 2, ...,m. 
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be treated as one measurement of m-dimensional random vector n = (ni, 
which has a multinomial distributio n, so the joint p.d.f. for the measurement of N 



and n can be converted to the form^^Zl 



/(n; 1^1,..., Um) = TT ^ "^^P (~^*) ■ (^) 



n. 
i=i 



Since now /(n; Vi, Vm) is the p.d.f. for one measurement, / is also the likelihood 
function 



L{n I vi,...,v^) = /(n;j/i, ...,!/„) . (9) 
With the use of Eq. ([7]) the corresponding likelihood function can be written as 

L(n I vtot,P,k) = i(n | vi{utot,P,k), ...,Vm{vtot,P,k)) . (10) 
Then the likelihood ratio is defined as 



L(n I i>tot,p,fc) ^ L (n I i>tot, j3,fc) ^^^^ 
L{n\i>i,...,i'rn) L{n\ni,...,nm) 

where i>tot: P and k are the ML estimates of Vtot, P and k with the likelihood function 
given by Eq. pUj) and t'i = rii, i = 1, 2, ...m are the ML estimates of Vi treated as 
free parameters. Note that since the denominator in Eq. (jlip does not depend on 
parameters, the log-ratio defined as 

, w ,^ , L{n\ptot,p,k) 

In A[vtot,P, k) = In ■ 



L{n I ni, ...,n„ 



( rii In — + - Tii j 

1 V * / 



= -i^tot + AT _ ^ n^ln — , (12) 

i=i 

where Vi are expressed by Eq. can be used to find the ML estimates of vtot, P 
and k. Further, the statistic given by 



m 

X^ = -21nA = 2^ fnan^ + i>»-7i» ) (13) 



1=1 



approaches the distribution asymptotically, i.e. as the number of measurements, 
here the number of events iV, goes to infinity (the consequence of the Wilks's the- 
orem, see Appendix B). The values Oi are the estimates of Vi given by 
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h = hof P{Y^■,p,k) (14) 

and if one assumes that vtot does not depend on p and k then vtot — N . For such a 
case 

m m 

^z>, = ^n, (15) 

1=1 1=1 

and Eq. ([T^ becomes 



X^(p,A;) = -21nA = 2 V njn— . (16) 

i—i 

Also then one can just put vtot = N and Eq. (|12p can be rewritten as 
In A(p, k) 

m m 
= iV-lnA^-^ ni\nni + ^ Uiln P{Yi;p, k) 



1=1 1=1 
nan^+iV^ ^lnP(r,;p,fc) 

m rn 



= _Ar \ Ini^- + iV > ^ Pr lnF(y,;p, fc), (17) 



1=1 i=l 



with the term depending on p and fc the same as Eq. (jA.4[) and P/^ = rti/N . 
Therefore conclusions of Appendix A holds here, i.e. the necessary conditions for 
the existence of the maximum is fi = (Nch), Eq. (|A.10[) is the equation which 
determines k and p is obtained with the help of Eq. (jA.9|) . Note that the maximum 
of In A is the minimum of = —2 In A, so from Eqs. and ([T7| one arrives at 



xLn = -2Nj:Prin^^^^. (18) 

i=l « 

In fact, the method just described assumes that the sum of Pi{p,k) = P{Yi;p,k) 
over all bins equals 1. But only the infinite sum of P{n]p,k) is 1. However the 
measured values of Ym are big enough (of the order of 20 at least for all considered 
cases) so in the vicinity of p and k the sum of P{n]p, k) up to n = Ym equals 1 
approximately (see the seventh column in Table [1]). Nevertheless, to calculate Xmm: 
Eq. (|18p . P{Yi;p, k) were normalized appropriately and these results are listed in the 
fifth column of Tables [iJlSl Another way to treat this problem is to create arbitrary 
the (to + l)st bin for all possible n > Ym and with P^+i — 0. Bins with Pf^ — 
{rii — equivalently) do not contribute in Eq. (|18p (see Ref.i5|. In practice, it means 
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that Xmin would be calculated also from Eq. ([TSl) but without the normalization. It 
has turned out that that way leads to much greater values of Xmin- 

4. Results and discussion 

The method described in Sections [5] and [3] requires that all bins in a given data 
set have the width equal to 1, so as the experimental probability Pf^ to measure a 
signal in the ith bin was equivalent to the probability of the measurement of (i — 1) 
charged particles (the first bin is the bin of charged particles detected). This is 
fulfilled for all bins of the considered data sets except the ends of their tails. In 
these tails the measured values of P/^ have been uniformly distributed over the bin 
range so as the method could be applied directly. If the bin width is not significantly 
greater than 1 then this approximation should not change substantially the value 
of Xmin given by Eq. (fTS)) because in the most cases Pl^ at tails are two orders 
smaller than in the main part of distributions. Also errors in tails are bigger, in the 
range 10 — 50%, increasing with i. 

Since the test statistic —2 In A has a distribution approximately in the large 
sample limit, it can be used as a test of the goodness-of-fit. The result of the test 
is given by the so-called p-value which is the probability of obtaining the value of 
the statistic, Eq. (fT3)) . equal to or greater then the value just obtained by the ML 
method for the present data set, when repeating the whole experiment many times: 



where f{z; rid) is the p.d.f. and Ud the number of degrees of freedom, Ud = m — 2 
here. 

The results of the analysis are presented in Table [TJ Note that for UA5 cases 
two possibilities of the corrected number of events are listed. This is because only 
the measured number of events, 6839, is given in Ref. [2] However, the fits have been 
done to the corrected distributions, so also the corrected number of events should 
be put into Eq. ([T8|) . The number have been estimated in the following way: in Fig. 4 
of Ref. [2l the mean of the observed distribution versus the corrected (true) number 
of particles is plotted, the curve is a straight line roughly with the tangent equal 
to ^ 0.8, so one can guess that the efficiency is also about 80%. Just to check how 
results are stable with respect to a change in the number of events, the case with 
70% efficiency has been also calculated. As one can see, for all ALICE cases the 
hypothesis in question should be rejected, whereas for the listed UA5 cases should 
be accepted. But it was claimed that charged-particle multiplicities measured in 
the limited pseudo-rapi dity w indows by the ALICE Collaboration are distributed 
according to the NBD I 3 | 4 | 12 | jjowever that conclusion was the result of the 
minimization (the LS method). Therefore it seems to be reasonable to check what 
are the values of the LS x^ function at the ML estimators listed in the third and 




(19) 
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Table 1. Results of fitting multiplicity distributions for the NSD events measured in p — p (UA5) ^ and p — p (ALICE) ^ 
collisions. The ALICE numbers of events are from Ref. 9. The distributions have been modified in the tails so as all bins have 
the width 1, see the text for explanations. 

















xis/"d 


with errors: 


Experiment 


N 


k 


P 


(n^) 


p- value 

[%] 


^ZP^{P,k) 


quadrature 
sum 


sum 


statistical 
only 


UA5 

= 0.9 TeV 
1 ri |< 0.5 


8550.0 
(80 % eff.) 


1.5574 
±0.0365 


0.3012 
±0.0054 


0.339 
10.16 

(30) 


99.97 


0.99996 


0.375 


na 


na 


UA5 

= 0.9 TeV 
1 V \< 0.5 


10000.0 

(70 % cff.) 


1.5574 
±0.0337 


0.3012 
±0.0050 


0.396 
11.88 

(30) 


99.87 


0.99996 


0.375 


na 


na 


ALICE 
= 0.9 TeV 
1 1^ U.O 


149663.16 


1.3764 
±0.0076 


0.2767 
±0.0012 


14.155 
353.88 
C95~l 





0.99960 


1.116 


0.576 


3.089 


ALICE 
= 0.9 TeV 

1 n 1 ^ 1 

1 r; 1^ i.u 


128476.45 


1.4316 
±0.0070 


0.1625 
±0.0008 


37.761 
1548.21 

C41 "1 





0.99865 


1.886 


1.034 


11.51 


ALICE 
y/s = 0.9 TeV 
1 V l< 1-3 


60142.77 


1.4955 
±0.0102 


0.1332 
±0.0009 


22.051 
1168.69 
(53) 





0.99876 


2.993 


1.671 


15.31 


UA5 

= 0.9 TeV 

1 n |< 1.5 


8550.0 
(80 % eff.) 


1.7987 
±0.0319 


0.1385 
±0.0024 


0.812 
60.12 
(74) 


87.81 


0.99991 


0.487 


na 


na 


UA5 
y/s = 0.9 TeV 
|»?|< 1.5 


10000.0 

(70 % eff.) 


1.7987 
±0.0295 


0.1385 
±0.0022 


0.950 
70.31 
(74) 


59.99 


0.99991 


0.487 


na 


na 


ALICE 
^ = 2.36 TeV 
1 ri |< 0.5 


38970.79 


1.1778 
±0.0115 


0.2084 
±0.0018 


6.266 
194.26 
(31) 





0.99930 


0.888 


0.501 


3.592 


ALICE 
= 2.36 TeV 
1 V \< 1-0 


37883.99 


1.2139 
±0.0103 


0.1180 
±0.0010 


17.416 
853.37 
(49) 





0.99726 


2.209 


1.312 


17.73 


ALICE 
= 2.36 TeV 
1 V \< 1-3 


22189.40 


1.2123 
±0.0129 


0.0927 
±0.0010 


15.561 
949.22 
(61) 





0.99644 


4.0557 


2.4537 


34.40 



fourth columns of Tabled] For the sample described in Sect. [3] one can define the 
LS function as: 
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Table 2. Results of testing the NBD for the original data sets of the NSD events measured in p — p (UA5) ^ and p — p 
(ALICE) ^ collisions. The ALICE numbers of events are from Ref. 9. The values of k and p are taken from Tabic [T] 



















errors: 




Experiment 
V~s 


N 


k 


P 




p- value 

[%] 


quadrature sum statistical 
sum only 




UA5 
= 0.9 TeV 
1 7? |< 0.5 


8550.0 

(80 % eff.) 


1.5574 
±0.0365 


0.3012 
±0.0054 


0.211 
4.859 
(23) 


99.998 


0.072 


na 


na 


0.203 


UA5 
Vi = 0.9 TeV 
1 V l< 0.5 


10000.0 
(70 % eff.) 


1.5574 
±0.0337 


0.3012 
±0.0050 


0.247 
5.683 
(23) 


99.991 


0.072 


na 


na 


0.237 


ALICE 
Vi = 0.9 TeV 
1 ri \< 0.5 


149663.16 


1.3764 
±0.0076 


0.2767 
±0.0012 


14.498 
347.95 

(24) 





0.728 


0.381 


2.458 


15.107 


ALICE 
Vi = 0.9 TeV 
\v\< 1-0 


128476.45 


1.4316 
±0.0070 


0.1625 
±0.0008 


36.855 
1547.91 
(42) 





1.718 


0.948 


11.010 


38.017 


ALICE 
Vi = 0.9 TeV 
1 r,|< L3 


60142.77 


1.4955 
±0.0102 


0.1332 
±0.0009 


24.323 
1167.51 

(48) 





2.213 


1.276 


15.201 


25.771 


UA5 
v/i = 0.9 TcV 
\V\< 1-5 


8550.0 
(80 % eff.) 


1.7987 
±0.0319 


0.1385 
±0.0024 


1.099 
57.16 
(52) 


28.94 


0.362 


na 


na 


1.14 


UA5 
= 0.9 TeV 
|7y|< 1.5 


10000.0 

(70 % eff.) 


1.7987 
±0.0295 


0.1385 
±0.0022 


1.286 
66.85 
(52) 


8.06 


0.362 


na 


na 


1.33 


ALICE 
^ = 2.36 TeV 
1 V l< 0.5 


38970.79 


1.1778 
±0.0115 


0.2084 
±0.0018 


7.030 
189.82 
(27) 





0.761 


0.428 


3.805 


7.465 


ALICE 
= 2.36 TeV 
\v\< 1-0 


37883.99 


1.2139 
±0.0103 


0.1180 
±0.0010 


18.535 
852.59 
(46) 





2.288 


1.362 


18.802 


20.282 


ALICE 
= 2.36 TeV 
\V\< 1-3 


22189.40 


1.2123 
±0.0129 


0.0927 
±0.0010 


18.233 
948.11 
(52) 





4.245 


2.599 


39.647 


19.980 



2 ^ (pr-p(y,;p,fc))^ 

XLsfefc) = 2^ ^ ' (20) 

2—1 

where err^ is the uncertainty of the ^th measurement. Here this function is not 
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minimized with respect to p and k as in the LS method but is calculated at ML 
estimates of p and k, i.e. at p and k. One can see from the eight and ninth columns of 
Table[T]that Xj^g/nd values are significant for the ALICE narrowest pseudo-rapidity 
windows, what agrees with the results of Ref. |12, 

Since the determination of k and p has been done for the distributions modified in 
their tails, as it has been just explained, one should check what values of and xis 
are at k and p for the original data sets. It means that if the ith bin width is greater 
than 1, instead of P{Yi;p, k) in Eq. (|18|) the appropriate sum ^ P{n;p, k) over 
n G bin i is taken. The results of the check are presented in Table [2] Qualitatively 
the results are the same as in Table [l] only slight differences in numbers can be 
noticed except the UA5 cases (for | ?7 |< 0.5 has decreased more than 2 times, but 
the change is in the good direction) . This is because the maximal width of a tail bin 
is 2 for aU ALICE cases, but is 8 and 17 for UA5 windows | r/ |< 0.5 and | r] \< 1.5, 
respectively. Of course, the assumption of the uniform distribution inside a wider 
bin causes greater discrepancies. Nevertheless, the results of the test for both UA5 
cases are positive even if (fc, p) is not the maximum of the exact likelihood function 
(in fact, values of k are the same as those obtained by UA5 Collaboration in Ref. [2|). 
This is guaranteed by the Wilks's theorem (see Appendix B), which allows for the 
test of a single point in the parameter space. Then the tested point might not be 
the best estimate of the true value but the hypothesis in question becomes the 
hypothesis only about a particular distribution (a simple hypothesis). This is also 
the reason why rid — m in Table [5] In terms of rigorous statistics single points are 
tested in there. 

In all ALICE cases values listed in the fifth column of Table[2]are only slightly 
smaller than corresponding ones from Table [TJ For | ij |< 0.5 the decrease is about 
2%, for other cases is less than 0.1%. Also x^ /ud values are much greater than 1. 
Therefore it is reasonable to recognize k and p determined for modified data sets as 
a good approximations of the ML estimators. Thus the hypothesis about the NBD 
should be rejected on the basis of obtained values of x^ jud and p- values. 

One can also compare x^ jud with xis/^d calculated for the original data sets 
and the same k and p. The results are listed in four last columns of Table [2] for 
various treatment of errors. Note that for UA5 conclusions from both statistics are 
exactly the same. In the ALICE both cases of the window | 77 |< 0.5, xis/'^d < 1 is 
acceptable for errors expressed as the quadrature sum of statistical and systematical 
components and is smaller than the corresponding values in Table[TJ In other ALICE 
cases xis/'^d substantially greater than 1 for the same treatment of errors. This is 
in the full agreement with the results of Ref. [12l One can also check what xis / "-d is 
if only statistical errors are taken into account. The results are listed in the next to 
last column of Tabled For all ALICE cases the values are much greater than 1. This 
means that acceptable xis/^d '^as obtained only because of significant systematic 
errors of ALICE measurements. The word "significant" is subjective, here means 
" significant with respect to the sample size" , not to the value of Pf^ . 

The crucial question is now why the conclusions from x^ and xis test statistics 
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Table 3. Results of testing the NBD for the original data sets of the NSD events measured in p — p (UA5) ^ and p — p 
(ALICE) 3 collisions at = 0.9 TeV. The values of k and p are taken from Tabic [T] but the ALICE numbers of events 
have been changed arbitrarily to the UA5 number of events. 

















xis/^d with errors: 




Experiment 


IN 


i- 


P 


X 


p- value 

[%] 


quadrature sum 
sum 


statistical 
only 




UA5 
Vi = 0.9 TeV 
1 ri |< 0.5 


8550.0 
(80 % eff.) 


1.5574 
±0.0365 


0.3012 
±0.0054 


0.211 
4.859 
(23) 


99.998 


0.072 


na 


na 


0.203 


ALICE 
V3 = 0.9 TeV 
1 r/ |< 0.5 


8550.0 


1.3764 
±0.0318 


0.2767 
±0.0051 


0.828 
19.88 
(24) 


70.37 


0.728 


0.381 


2.458 


0.863 


ALICE 
V3 = 0.9 TeV 
1 V \< 1-0 


8550.0 


1.4316 
±0.0272 


0.1625 
±0.0029 


2.453 
103.01 

(42) 


5 -10-5 


1.718 


0.948 


11.010 


2.530 


ALICE 
yi = 0.9 TeV 
1 V l< 1-3 


8550.0 


1.4955 
±0.0271 


0.1332 
±0.0024 


3.458 
165.97 

(48) 


7 -10-" 


2.213 


1.276 


15.201 


3.664 


UA5 
= 0.9 TeV 
1 r, |< 1.5 


8550.0 
(80 % eff.) 


1.7987 
±0.0319 


0.1385 
±0.0024 


1.099 
57.16 

(52) 


28.94 


0.362 


na 


na 


1.14 



are the same for UA5 data but entirely opposite for ALICE measurements? The 
main difference between both statistics is that depends explicitly on the number 
of events but xis does not. On opposite, does not depend on the actual errors 
but does. In fact, statistic implicitly assumes errors of the type what 
is the straightforward result of the form of the likelihood function, Eqs. ([8]) and ([9]), 
namely the product of Poisson distributions. This is revealed when one compare 
X^/^d and x\sl''^d '^ith errors ^ ^/nl (the fifth and last column in Table [2]). The 
values are practically the same. 

To find out what is the reason for the above-mentioned disagreement the calcu- 
lations of Table [2] have been repeated for ALICE measurements at = 0.9 TeV 
but with the arbitrary assumption that all cases have the same number of events 
as UA5 ones. The results are listed in Table |3l One can see that now there is full 
agreement between and x|g test statistic results for all ALICE cases. This means 
that the accuracy with which experimental distributions approximate the NBD has 
not increased in ALICE data even though the sample sizes are one order greater. 
But the accuracy should increase with the sample size because if the hypothesis is 
true the postulated form of distribution is exact for the whole population. So with 
the growing number of events, the experimental distribution should be closer to the 
postulated one. This is also seen in the form of Xmim Eq. ([T8| . where the linear 
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dependence on N is explicit. To keep Xmin least constant when N (the sample 
size) is growing the relative differences between P{Yi) and P^^ have to decrease. 

5. Conclusions 

The main conclusion is that the hypothesis of the NBD of charged-particle multi- 
plicities measured by the ALICE Collaboration in proton-proton collisions at y/s = 
0.9 and 2.36 TeV should be rejected for all pseudo-rapidity window classes. This is 
the result of likelihood ratio tests performed for the corresponding data samples. 
The significant systematic errors are the reasons for acceptable values of the least 
squares test statistic for the narrowest pseudo-rapidity window measurements. 

The second conclusion is that the size of "proper" errors {i.e. not too big and not 
too small, both extremes cause the false inference from 'Xj^sl'^^d values) is somehow 
related to the sample size. Here, for instance, errors of the type y/nl could be "a 
frame of reference" as it has been revealed from the results gathered in Tables [5] 
and |3l This is connected with the meaning of the formulation of a hypothesis. If 
the hypothesis is true, it means that the form of a distribution postulated by this 
hypothesis is exact for the whole population. Thus for the very large samples (as 
in all ALICE cases) the measured distribution should be very close to that postu- 
lated. The performed analysis has shown that the ALICE experimental errors are 
much bigger than the acceptable discrepancies (acceptable for these sample sizes). 
Therefore and Xl^ test statistics give the opposite answers in the narrowest 
pseudo-rapidity windows of the ALICE measurements. For the UA5 sample sizes, 
which are much smaller than the ALICE ones, the experimental errors have turned 
out to be of the order of acceptable discrepancies, so both test statistics give the 
same answer. 
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Appendix A. 

The sample defined in Sect. [5] can be divided into m bins with the different value 
of measured Nch in each bin. Let n,; be the number of events in the zth bin, i.e. 
events with the same measured value of Nch, say Yi. Then the number of events in 
the sample equals 



m 




(A.l) 



i=l 
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Dividing by N one can obtain the condition for experimental probabilities (frequen- 
cies) Pf^: 



i=l 



Now the likelihood function, Eq. ([3]), can be rewritten as 



N 



L{X\p,k) = l[ PiXf,p,k) = ll P{Y,;p,kr 



L{Y\p,k) = 



H PiYf,p,k)^ 



N 



N 



and the corresponding log-likelihood function reads 



lnL(Y|p,fc) = 7V^ Pr \nP{Yf,p,k) 



i=l 



Since the logarithm of the NBD is given by 



lnP(n;p, k) 



In (fc + j — 1) + n In (1 — p) + fc Inp — In (n!) 



(A.2) 



(A.3) 



(A.4) 



(A.5) 



the necessary conditions for the existence of the maximum, Eqs. ([2]), have the fol- 
lowing form: 



^lnL{Y\p,k) 



1-p P 



N 
N 



1=1 

l-p p 



= 



(A.' 



-\nL{Y\p,k) 
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r Yi 



NY^pr E 

=1 '-j=i 

m Yi 



1 



N 



k + j-1 
1 

k + j-1 



hip 
Inp 



0, 



(A.7) 



where the sum over is if 1^ = 0. 

From Eqs. (|A.6|) and ([2|) one can obtain: 



, k{l-p) 



(A., 



Expressing p as a function of k and (Nch) 



- = ^ + 1, (A.9) 
p k 

and substituting it to Eq. (jA.7[) the equation which determines k is obtained: 



— lnL{Y\p,k) 



N 



Yi 



hi 1 



{Nch 



. 



(A.IO) 



The above equation can be solved numerically. Having obtained k and substituting 
it into Eq. (|X9|) p is derived. 

Appendix B. Wilks's theorem 

Let X be a random variable with p.d.f /(X, 6), which depends on parameters 9 = 
{6'i, 6*2, ■■■,0d} G Q, where a parameter space 9 is an open set in R'^. For the set 
of N independent observations of X, X = {ATi, X2,...,Xjv}, one can defined the 
likelihood function 



N 



L(x|0) = n f{x,-e) 



(B.l) 



Now consider Hq, a fc-dimensional subset of 8, fc < d. Then the maximum likelihood 
ratio can be defined as 



A = 



maxgggp £(X | 
maxeee L{X. \ 



(B.2) 
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This is a statistic because it does not depend on parameters 9 no more, in the 
numerator and the denominator there are hkehhood function values at the ML 
estimators of parameters 9 with respect to sets Hq and Q, respectively. 

The Wilks's theorem says that under certain regularity conditions if the hypoth- 
esis Hq is true {i.e. it is true that 9 € Hq), then the distribution of the statistic 
—2 In A converges to a distribution with d — k degrees of freedom as N — > oo 
ISHni. The proof can be found in Ref. [TTl Note that /c = is possible, so one point 
in the parameter space (one value of the parameter) can be tested as well. 
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